A fast coarse filtering method for peptide identification by mass spectrometry

نویسندگان

  • Smriti R. Ramakrishnan
  • Rui Mao
  • Aleksey A. Nakorchevskiy
  • John T. Prince
  • Willard S. Willard
  • Weijia Xu
  • Edward M. Marcotte
  • Daniel P. Miranker
چکیده

MOTIVATION We reformulate the problem of comparing mass-spectra by mapping spectra to a vector space model. Our search method leverages a metric space indexing algorithm to produce an initial candidate set, which can be followed by any fine ranking scheme. RESULTS We consider three distance measures integrated into a multi-vantage point index structure. Of these, a semi-metric fuzzy-cosine distance using peptide precursor mass constraints performs the best. The index acts as a coarse, lossless filter with respect to the SEQUEST and ProFound scoring schemes, reducing the number of distance computations and returned candidates for fine filtering to about 0.5% and 0.02% of the database respectively. The fuzzy cosine distance term improves specificity over a peptide precursor mass filter, reducing the number of returned candidates by an order of magnitude. Run time measurements suggest proportional speedups in overall search times. Using an implementation of ProFound's Bayesian score as an example of a fine filter on a test set of Escherichia coli protein fragmentation spectra, the top results of our sample system are consistent with that of SEQUEST.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two-phase Filtering Strategy for Efficient Peptide Identification from Mass Spectrometry.

Peptide identification by tandem mass spectrometry (MS/MS) is one of the most important problems in proteomics. Recent advances in high throughput MS/MS experiments result in huge amount of spectra, and the peptide identification process should keep pace. In this paper, we strive to achieve high accuracy and efficiency for peptide identification with the presence of noise by a two-phase filteri...

متن کامل

PepSOM: an algorithm for peptide identification by tandem mass spectrometry based on SOM.

Peptide identification by tandem mass spectrometry is both an important and challenging problem in proteomics. At present, huge amount of spectrum data are generated by high throughput mass spectrometers at a very fast pace, but algorithms to analyze these spectra are either too slow, not accurate enough, or only gives partial sequences or sequence tags. In this paper, we emphasize on the balan...

متن کامل

FAST ATOM BOMBARDMENT MASS SPECTROMETRY (FABMS) ANALYSIS OF AN N- TERMINAL - BLOCKED PEPTIDE

FABMS analysis of T-lb peptide before and after one cycle of Edman degradation indicated an unblocked N-terminal Thr residue for this tryptic peptide. In contrast , our data showed a molecular protonated ion, MH + for T- la peptide at 655 mass units (mu) which is 42 mu higher than the MH ion of T- 1b peptide. In addition, T- la peptide was not amenable to one cycle of manual Edman degrada...

متن کامل

Impact of Pharmaceutical Impurities in Ecstasy Tablets: Gas Chromatography-Mass Spectrometry Study

In this study, a simple and reliable method by gas chromatograph–mass spectrometry (GC–MS) was developed for the fast and regular identification of 3, 4-MDMA impurities in ecstasy tablets. In so doing, 8 samples of impurities were extracted by diethyl ether under alkaline condition and then analyzed by GC–MS. The results revealed high MDMA levels ranging from 37.6% to 57.7%. The GC-MS method sh...

متن کامل

Impact of Pharmaceutical Impurities in Ecstasy Tablets: Gas Chromatography-Mass Spectrometry Study

In this study, a simple and reliable method by gas chromatograph–mass spectrometry (GC–MS) was developed for the fast and regular identification of 3, 4-MDMA impurities in ecstasy tablets. In so doing, 8 samples of impurities were extracted by diethyl ether under alkaline condition and then analyzed by GC–MS. The results revealed high MDMA levels ranging from 37.6% to 57.7%. The GC-MS method sh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 22 12  شماره 

صفحات  -

تاریخ انتشار 2006